Prediction of protein-ligand interactions from paired protein sequence motifs and ligand substructures.

نویسندگان

  • Peyton Greenside
  • Maureen Hillenmeyer
  • Anshul Kundaje
چکیده

Identification of small molecule ligands that bind to proteins is a critical step in drug discovery. Computational methods have been developed to accelerate the prediction of protein-ligand binding, but often depend on 3D protein structures. As only a limited number of protein 3D structures have been resolved, the ability to predict protein-ligand interactions without relying on a 3D representation would be highly valuable. We use an interpretable confidence-rated boosting algorithm to predict protein-ligand interactions with high accuracy from ligand chemical substructures and protein 1D sequence motifs, without relying on 3D protein structures. We compare several protein motif definitions, assess generalization of our model's predictions to unseen proteins and ligands, demonstrate recovery of well established interactions and identify globally predictive protein-ligand motif pairs. By bridging biological and chemical perspectives, we demonstrate that it is possible to predict protein-ligand interactions using only motif-based features and that interpretation of these features can reveal new insights into the molecular mechanics underlying each interaction. Our work also lays a foundation to explore more predictive feature sets and sophisticated machine learning approaches as well as other applications, such as predicting unintended interactions or the effects of mutations.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

P-31: The Alteration of SpermatogenesisHas A Correlation with Sertoli Cell Mitochondrial Abnormal Morphology in Cytotoxicity of Testicular Tissue Mediatedwith Monosodium

Background: Male infertility has many causes, including genetic infertility. The NOP2/Sun domain family, member7 (Nsun7) gene, which encodes putative methyltransferase Nsun7, has a role in sperm motility. The aim of the present study was to investigate the effect of the T26248G polymorphism on Nsun7 protein function and its role in male infertility. Materials and Methods: Semen samples were col...

متن کامل

P-30: The Effect of The T26248G Polymorphism on Putative MethyltransferaseNsun7 Protein Function and Its Role in Male Infertility

Background: Male infertility has many causes, including genetic infertility. The NOP2/Sun domain family, member7 (Nsun7) gene, which encodes putative methyltransferase Nsun7, has a role in sperm motility. The aim of the present study was to investigate the effect of the T26248G polymorphism on Nsun7 protein function and its role in male infertility. Materials and Methods: Semen samples were col...

متن کامل

Biological Applications of Isothermal Titration Calorimetry

     Most of the biological phenomena are influenced by intermolecular recognition and interaction. Thus, understanding the thermodynamics of biomacromolecule ligand interaction is a very interesting area in biochemistry and biotechnology. One of the most powerful techniques to obtain precise information about the energetics of (bio) molecules binding to other biological macromolecules is isoth...

متن کامل

Conserved Core Substructures in the Overlay of Protein-Ligand Complexes

The method of conserved core substructure matching (CSM) for the overlay of protein-ligand complexes is described. The method relies upon distance geometry to align structurally similar substructures without regard to sequence similarity onto substructures from a reference protein empirically selected to include key determinants of binding site location and geometry. The error in ligand positio...

متن کامل

Biochemical Aspects of Protein Changes in Seed Physiology and Germination

Seed storage proteins are synthesized as sources of carbon, nitrogen and sulfur for the next generation of plants. Reactive oxygen species serve as second messengers for signal transduction; however, molecular targets of oxidant signaling have not been defined. Here, many researchers showes that ligand–receptor mediated signaling promotes reactive oxygen species– dependent protein carbonylation...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing

دوره 23  شماره 

صفحات  -

تاریخ انتشار 2018